Using Text Classification to Predict the Gene Knockout Behaviour of S. Cerevisiae

نویسنده

  • Patrick Caldon
چکیده

A naive Bayes classifier was used to analyze gene behavior based on text data and presented as an entry for the 2002 KDD Cup, a data mining exercise to predict the behavior of the yeast S. Cerevisiae. The solution presented was based on the multinomial event model for text classification(McCallum & Nigam 1998) with a feature selection mechanism added. Despite this simple model, performance close to that of the best entries in the competition could be obtained, which were using more sophisticated techniques. It appears that seemingly minor effort in using prior knowledge to conflate the gene classes, as well as the previously described effectiveness of the naive Bayes method contributed to this success.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

Therapeutic Efficacy Analysis of lncRNA NEAT1 Gene Knockout and Apoptosis Induction in Prostate Cancer Cell Line Using CRISPR/Cas9

Background and Objective: Long non-coding ribonucleic acid (lncRNA) has been identified as an important gene regulator and prognostic marker in various cancers. The present study aimed to investigate the effects of Nuclear Paraspeckle Assembly Transcript1 (NEAT1) gene knockout using Clustered Regularly Interspaced Short Palindromic Repeats-associated Protein 9 (CRISPR/Cas9) in PC-3 cell line. ...

متن کامل

Inhibitory Effect of Supernatant and Lysate of Saccharomyces cerevisiae on Expression of exoA Gene of Pseudomonas aeruginosa

Background and Aim: Pseudomonas aeruginosa is an important ubiquitous and especially common pathogen in the hospital. Exotoxin A that encoded by exoA gene has a role in pathogenesis of this bacterium. Today, probiotics are widely used in the treatment and prevention of diseases. The present study aimed to study the Saccharomyces cerevisiae S3 effect on the expression of exoA gene. Materials an...

متن کامل

Efficient Production of Biallelic RAG1 Knockout Mouse Embryonic Stem Cell Using CRISPR/Cas9

Background: Recombination Activating Genes (RAG) mutated embryonic stem cells are (ES) cells which are unable to perform V (D) J recombination. These cells can be used for generation of immunodeficient mouse. Creating biallelic mutations by CRISPR/Cas9 genome editing has emerged as a powerful technique to generate site-specific mutations in different sequences. Ob...

متن کامل

Modelling of Stress-Strain Behaviour of Clayey Soils Using Artificial Neural Network

In this research, behaviour of clayey soils under triaxial loading is studied using Neural Network. The models have been prepared to predict the stress-strain behaviour of remolded clays under undrained condition. The advantage of the model developed is that simple parameters such as physical characteristics of soils like water content, fine content, Atterberg limits and so on, are used to mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003